ON PATTERN OCCURRENCES IN A RANDOM TEXTApril

نویسندگان

  • Ioannis Fudos
  • Evaggelia Pitoura
  • Wojciech Szpankowski
چکیده

Consider a given pattern H and a random text T of length n. We assume that symbols in the text occur independently, and various symbols have diierent probabilities of occurrence (i.e., the so called asymmetric Bernoulli model). We are concerned with the probability of exactly r occurrences of H in the text T. We derive the generating function of this probability, and show that asymptotically it behaves as n r n?r?1 H , where is an explicitly computed constant, and H < 1 is the root of an equation depending on the structure of the pattern. We then extend these ndings to random patterns.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

On Pattern Occurrences in a Random Text

Consider a given pattern H and a random text T of length n. We assume that symbols in the text occur independently, and various symbols have different probabilities of occurrence (l.e., the so called asymmetric Bernoulli modeQ. We are concerned with the probability of exactly T occurrences of H in the text T. We derive the generating function of this probability, and show that asymptotically it...

متن کامل

2 1 N ov 2 00 5 Berry Esseen Bounds for Combinatorial Central Limit Theorems and Pattern Occurrences , using Zero and Size Biasing ∗ † Larry Goldstein University of Southern California

Berry Esseen type bounds to the normal, based on zeroand size-bias couplings, are derived using Stein’s method. The zero biasing bounds are illustrated with an application to combinatorial central limit theorems where the random permutation has either the uniform distribution or one which is constant over permutations with the same cycle type and having no fixed points. The size biasing bounds ...

متن کامل

Berry–esseen Bounds for Combinatorial Central Limit Theorems and Pattern Occurrences, Using Zero and Size Biasing

Berry–Esseen-type bounds to the normal, based on zeroand size-bias couplings, are derived using Stein’s method. The zero biasing bounds are illustrated in an application to combinatorial central limit theorems in which the random permutation has either the uniform distribution or one that is constant over permutations with the same cycle type, with no fixed points. The size biasing bounds are a...

متن کامل

Pattern Discovery as Event Association

A basic task of machine learning and data mining is to automatically uncover patterns that reflect regularities in a data set. When dealing with a large database, especially when domain knowledge is not available or very weak, this can be a challenging task. The purpose of pattern discovery is to find non-random relations among events from data sets. For example, the “exclusive OR” (XOR) proble...

متن کامل

Frequency of Pattern Occurences in a (DNA) Sequence

Consider a given pattern H and a random text T oflength n. We assltme that consecutive symbols in the texl are generated either independently or with a Markovian dependency, i.e., we stItely both the so called Bernoulli model and the Markovian model. OUf goal is to assess the limiting distribution of the frequency of the pattern occurrences ln a random sequence. Overlapping copies of a pattern ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1996